
From non-convexity to convexity: A 
game theoretic bridge 



In general, non-convex non-separable non-differentiable 
problems is known to be hard 

We start from separable smooth (continuously 
differentiable) non-convex functions with convex constraint 

Why non-convex is hard: Lagrangian-dual is used to 
convert a constrained problem to an unconstrained one, 
however non-convexity leads to duality gap; even 
unconstrained, multiple local extremes leads to sub- 
optimality 

Whether we can find a convex problem such that the 
optimizer of the non-convex one is consistent with the 
optimizer of the corresponding convex one? 



Review of potential game theory 



A Game G =< MAu } A/r ATl } „ > (M is the user set, 

^ m is the local objective for user m, and Il m is the constraint 
set for user m) is a potential game if there is a function 

0 : n = ®n m -> 9? satisfying 

i.e., —o(x) = — u m (xjym when (J) is smooth. 0 is the 
corresponding potential function. 

When 0 is concave ad smooth, and IT is convex, the set of 
maximums of 0 is consistent with the set of equilibriums 
of G 



First try 



If we can construct a potential game with a strictly concave 
potential, and the global optimizer of the non-concave 
problem is an equilibrium of the potential game. Then this 
global optimizer is consistent with the maximum of the 
concave potential function 

If we let the global objective be the local objective for all 
users, then the global optimizer is an equilibrium of this 
game. However, this game does not have concave potential 

Operations (e.g., change of variables) that preserve the 
nature of the problem can not convert a con-convex 
problem to convex. Need to introduce auxiliary variable 



A class of non-convex functions 



Optimization Problem: Hiax / (x) 

Variable X — y JC^ ? x^ 5 • • • 5 X j ) is an L dimensional vector 

Constraint Y\ x is convex 

The objective f(x) is bridgeable over the constraint if 

• Continuously differentiable (smooth) 

• Positive inherently separable 

• Seemingly strictly concave 



Inherently separable 



A function f(x) is inherently separable if it can be written 
as t 



s- ^ [A] 



1=1 




K 




K = [A]x , where det[ A] * 0, i.e., x ^ is linear 
injection . Thus, > constraint of f[ , is also convex, 
and can be viewed as the new basis 

<Pi are arbitrary differentiable functions. We call Pi 
"separating functions" 



Separable ->inherently separable 



n 



k y 1 



Positive inherently separable 



A function f(x) is positive inherently separable if it is 
inherently separable and the separating functions satisfy 
(strictly positive inherently separable if the inequality holds 
strictly) : , 

0 < — (p t {K t ) < 1, V/, VK t G [c* , c k im ] 
dK { lM 

Let Qi = <PA K i), then K t = q>\ (<2,). By applying 
derivative of inverse function, d _ t 1 



g>, (a ) = 

the above condition is equiv to Jg, ^'[^ ((^ )] 

dQ, 



Seemingly concave 



If f(x) is positive inherently separable, i.e., 

/(*) = Z/z(*i -^/(^/)). * = det[A] * 0; 

Z=l 

o<^(ir ; )<i,v/,vlGn„ 

and if //(•), V/ are (strictly) concave, then f(x) is 
seemingly (strictly) concave 



Examples 



(x 1 + 2x 2 - e Xl+2x2 ) 2 + (x x + x 2 - e Xl+Xl ) 2 , x e 91 is 

inherently separable but not positive inherently separable 

logfl + Xj -log(l + X x )] + log[l + X 2 -log(l + X 2 )],X E 91 
is separable, then inherently separable; and it is also strictly 

positive inherently separable, and seemingly strictly 

concave 

f(x) = (1 + e ~ x ) , X E 91 (sigmoid) is positive inherently 
separable and seemingly strictly concave: 

+ e -*y l = l 0 g{K -[K- e (l+e ~ K/Myl ]},K = Mx, <p(K) = K-e f 
(p' = l-—f'e f . This function, /,/' are bounded and / > 0 , 
so it is possible to find M 



Constructing potential game 



L 



Rewrite = -0* (*,)), 

Game 1=1 



L 

^ U 



maxJ^f l (K l -Q l ) = „ K 



Q 1=1 



m^xf j [f l (K l -Q l ) + g l (Q l )] = u Q 



Q are auxiliary variables 




Constructing potential game (Cont.) 



Auxiliary variable Q is unconstrained, necessary condition 



d f j [f l (K l -Q l ) + g l (Q l )] = Oyi 



and require Q\ — <Pi \K l ) to always be the solution 
. Thismeans + 

dQi 1=1 

= ^rlf l (K l -Q l ) + g l (Q l )] 

require strictly concavity of ' ^ ' rrw n 

f so that V is not constant = ~fiW (Qi ) ~ Qi J + 81 (Qi ) = 0 



Constructing potential game (Cont.) 



Thus, for Vg/ etc^cj^], 

/7s strictly concave, positive inherently separable leads to 
concavity of g, and strictly positive inherently separable leads 
to strictly concavity of g 

Lemma: the global optimizer of bridgeable functions over a 
convex set is an equilibrium of the constructed potential 
game 



Proof outline of the lemma 



Given optimizer x* , \et(K ,Q ) denote the corresponding 
point in the game. Suppose it is not an equilibrium, then 

K is not the optimizer of M * = f (f[ _ Q \ under 
constraint 11^ /=1 

Let (K e , Q e ) be the equilibrium, this means f[ e is the 
optimizer of e ^ tf under n * 

/=i 

Note 



Proof outline of the lemma (Cont.) 




Note 0 < — — = — — <p{K) < 1 we will reach a contradiction 

dK dK 

f(K*)<f(K e ) 



Bridge from non-convex to convex 



Theorem: If f(x) is bridgeable, by writing f(x) as 

/(*) = f J f l (K l -<p l (K l )),K = [A]x. and lettin g 
®(K,Q) = j^[f l (K l -Q l ) + g^Q,)] , where 

1=1 

ft _ 

8, (Qi ) = fi VP~ X (Qi)-Qi\ , then ®(K,Q) is concave. 
Let {K m ,O m } be the set of optimizer for max 0(K,Q) 

1 J 1 — /TerU,£e(x)[c G ,c G ] 

and let M = {x m : 3c* = [A]" 1 /T }. Then/the global 
optimizer of max f(x) where II convex, belongs to M. 

xeU x 




Bridge from non-convex to convex 
(Cont.) 



ft 



The theorem can be proved using the previous arguments 
and lemma, combined with the potential game theory 

Corollary: lff(x) is strictly positive inherently separable, 
i.e., the separating functions satisfy §<(p[< 1, V/, then 

(J)( K, Q) i s strictly concave, and the global optimizer of 

fix) is mapped to the unique optimizer of 0(A", Q) 

The strictly concavity can be verified from the following 
Hessian 



ft 



ft 



ft 



-f, {K l -Q l )f l (K l -Q l ) + g l (a) 



0 



ft 



(Q,) 



Handling non-strictly concavity 



If 0(A", Q) is not strictly concave, from the previous 
discussion, this non-strictly concavity is due to g l \Q l ) , by 

adding quadratic terms _ V -Q) 2 ' we can handle it 
by using proximal algorithm /=1 

The bridging method is not operations that preserves the 
nature of an optimization problem. It introduces auxiliary 
variables and the potential function is not an 
transformation from the original objective 

Obtaining the explicit expression of g l {Q x ) from 

gi (Qi) = // W\Qi) ~ Qi\ and <Pi~ ( # ) can sometimes 
be difficult, do we need to get g (Q ) and (p { (•) in the 

algorithm? Fortunately, no 



Implementation 



Without loss of generality, let 11^ = {K : [C]K < c M } 
which satisfies the Slater s condition, then the Lagrangian 

L(K,Q,X) = ®(K,Q)-A r ([C]K-c M l 

In the dual algorithm, at each iteration m, given Ji (m) > 0, 
need to solve _ _ _ 

mdLx{®(K,Q)-£[C]K} 
[®(K,Q) -F[C]K}= -f [K.-Q.] + g (Q.) = 0 



dQ l 




Implementation (Co 



L 



dK 



I 1=1 



K (m),temp Qan be ca l cu l ate( J 

using Newton's method without 
knowing the expression of £/(•) and (p *(•) 

Even when has extra simple constraint C l m < K t < C { M , 



due to concavity: 



(m),temp 
I 



<c 



Lm 



< 



C 



, if £ 



(m),temp 



LM > I 



>C 



l,m 



K 



V 



(m),temp 
I 



, o/w 



J Illustration of brid ging 



f(x) = -[x - log(l + x) - If, x > 0 
®(K,Q) = -[K-Q-lf+Q 2 +4Q 
Q = <P(K) = log(l + K) ^ 




0 1 2 3 4 5 0 0 




How large the class of functions is 



Recall that the class of functions we are interested (define as 
bridgeable functions) in has the following properties: 

• Continuously differentiable (smooth) 

• Positive inherently separable 

• Seemingly strictly concave 

It is apparent that inherently separable concave functions 
are included 

From previous examples, we observe that some sigmoid 
functions belong to this class, the first question is whether 
the whole sigmoid class is contained? 



Monotone and Sigmoid functions 



Some properties of sigmoid functions f(x) : 
• "S" or opposite "S" shaped: either f\x) > 0 or f\x) < 0 



Bounded: 



< c 2 < oo 



f{x) <q<oo and f\x) 
• Have one inflection point 

Proposition: The class of inherently separable monotone 
functions with bounded function value and bounded first 
derivative belongs to the class of bridgeable functions. 
Strictly monotone leads to strictly positive inherently 
separable. 

Corollary: The class of inherently separable sigmoid 
functions belongs to the class of bridgeable functions 



Proof outline of the proposition 



Note / (x) = log {x - [x - e J w ] } = log {-jc - [-x - e J (x> ] } 

• If f'(x)>0, let K = Mx,M >0, then 

f(K) = \og{K -[K- e f(K,M) ]},(p(K) = K- e f{KIM) 
<p' = l- f'e f I M ■ Choose M so that (p' e [0,1] 
. If f'(x)<0,\etK = -Mx,M >0,then 

/(AT) = log{/^ -[K- e f( - KIM) ]},<p(K) = K- e f( ~ KIM) 
<p' = \ + f' e f I M < 1. Choose M so that G [0,1] 

• Sigmoid function optimization under convex constraints 
is a non-convex problem in nature. This tells us that the 
game theoretic bridging is unlike change of variables etc, 
that preserves the nature of the function 




More than monotone and sigmoid 
functions 



Allow multiple inflection points: 



fix) 



X 



Allow unbounded value and 
unbounded first derivative , do not need to be monotone 

f(x) = -[x - log(l + x) - 1] 2 , X > 0 
Q = (p{K) = log(l + K) 

K = x>0 

/(•) = -[• "If 



-0.5 



-1.5 



>< -2.5 




An outer bound for bridgeable 
functions 



• Theorem: A bridgeable function has at most one local 
maxima 

• Proof outline: all bridgeable function optimization under 
convex constraint can be bridged to a concave potential 
function optimization under the corresponding convex 
constraint; potential objective construction does not use 
constraint information ^^—^ 



f(K x ) 
f(K*) 




t X* K 



Unknown gap 



Example: f(x) = exp[log(x) - x],X > 0 

/'>0,x€[0,l];/'<0,xeM 

This function has only one local maxima, however, it is not 
bridgeable over the whole constraint 

Bridgeable or not is highly related to the constraint set, so it 
should be evaluated under the constraint. Recall that 
®K m ^J is the region in which 0<^^(K t )<iyi ^ Thus, 
if cz ° , then/( x ) is bridgeable over U xy 

otherwise, it is only bridgeable over a subset mapped from 

®[c K ,c K ] 

^ l,m I ,M 




Illustration of the bridgeable class 



nherently non-convex Bridgeable Convex 



Non-inherently 
separable funs or 
funs with multiple 
local maximums 
or non-smooth 



Bridging can 
still applies 




bounded 

monotone 

functions 




specific funs] with 
s )ecific constraint 



Non- 
inherently 
separably 
or on- 
smooth 



complex numerical bridged to convex 

method duality method 



Application: Network utility 
maximization (NUM) 

• Problem formulation: v^ TT , N 

where n is a compact convex set, and £/;(•) are bridgeable. 
Physically, ^ can be link rate, net gain, cost etc., the 
constraint can be capacity constraint and other QoS 
constraint etc. All NUM type formulations have separable 
objectives 

• This problem is ready to be solved in a distributed way with 
the previous preparation 

• Except for the convergence gap due to the dual algorithm, 
there is no theoretical gap of the solution 




Comparison with existing method 



[Lee, Mazumdar, Shroff] 05, Non-Convex Optimization and 
Rate Control for Multi-Class Services in the Internet : 
distributed, but asymptotically optimal with increasing 
number of users, and only deal with sigmoid 

[Fazel, Chiang] 06, Network Utility Maximization With 
Non-concave Utilities Using Sum-of-Squares Method: 
consider polynomial (possibly multiple extremes) objective, 
but using convex relaxation methods (only achieves an 
upper bound, i.e., there is a theoretical gap), and 
centralized 



Can we do more: non-separable but 
convex 

• Bridging method can be applied to optimizations of 
bridgeable functions under convex constraint. Whether this 
method can be used for other functions? 

• All convex problems, no matter separable or not, can be 
viewed as a special case which is already bridged, i.e., no 
auxiliary variables 

• In most practical scenarios, we are interested in a compact 
set and functions with bounded derivative, so the 
boundedness is not a big issue 

• Difficulties: multiple local extremes and non-separable 



Can we do more: non-separable non- 
convex 

• A general version of the bridging theorem: If f(x) can be 
written as F(K - Q) , where the injection n x Ii K preserve 
convexity , F is continuously differentiable and concave, —f 
are not constant, and Q = <p(K) is an function satisfying 

r (K 1 -K 2 ) = 0(K 1 )-0(K 2 )yK v K 1 eU K (1) 
where 0 < y < 1 is some constant. Further, if there is a 
concave function g(Q) such that 

^F(0-\Q)-Q) + ^g(Q) = O 9 Vl (2) 

u • rs^ d Ql „ 1111 

then maximizing J \X) over convex U x can be bridged to 
a convex problem 

In general, it's extremely difficult to find a construction that satisfies both (1) & (2)1 




Can we do more: separable with 
multiple extremes 



Example: max f(x), where f(x) = x - log(x) 

0.5<x>1.5 

/' < 0, x € [0.5,1] ; / ' > 0, x € [1,1 .5] 

Let ^ = -Mx, Vx € [0,1] ; K = Mx, Vx € [1,+ao] , then 
for each set, the function is bridgeable. Solve for both sets 
and get the maximum 

The number of sets for L dimensional X is 2 (exponential) ! 



Can we do more: separable with 
multiple extreme s (Cont.) 

• The conditions in the general bridging theorem are difficult 
to verify, however, when the function is inherently separable, 
we only need to check the first derivative of each separating 
function <Pi. From the previous example, we can see that 
multiple extremes leads to unbridgeable due to not being 
monotone . 

• Without loss of generality, we consider a NUM 

L 

maxJ^/C*/) s.t.0< x t <c l9 [H]x<C 

x 1=1 

where [H] has nonnegative entries, e.g., routing matrix 
and C is component-wisely nonnegative, e.g., capacity 



Future work: Can we do more: 
separable with multiple extremes 
-J (Cont.) 

• U l (x t ) : bounded value and first derivative 

• Rough idea: from non-bridgeable to bridgeable 

Starting from zero, construct 
an increasing function U l 
that covers the original one; 

[H] has nonnegative entries, 
any maximum will not lie in 
the dotted area when using jj 
to run the optimization, this 
means we can optimize over {J 

0 c i x i 

• We construct increasing ( K l = Mx x , M >^0, [H ] still has 
nonnegative entries, and^/(^/) = K { — e Ul{Kl ' M) ) functions, 
but it is not differentiate 





Future work: Can we do more: 
separable with multiple extremes 
(Cont.) 



Rough idea of handling non differentiability: Let N t be the 
number of non-differentiable points of u l , then there are 

UN l non differentiate points of the sum objective and 

n(l + Af,) differentiate regions. At each region, do the 
bridging and run dual method. Recall that for the primal 
maximization over K l , by setting the first derivative zero, we 
getf l '(K l -c Pl (K l ))-^A k H kl =o. Note that //(•) = - and^ z (^ z ) is 
continuous over the whole constraint. This means the 
points around the non-differentiable points have consistent 
primal update equation. It is natural to expect that the 
potential function will be continuous and concave with UN l 
non-differentiable pts, and sub-gradient method may apply 




Application: Downlink power 
allocation under SINR model 



Same model as in [Lee, Mazumdar, Shroff] 05 , Downlink 
Power Allocation for Multi-class Wireless Systems 

N - NP 

max X U t (r,. (P)), s.t. 0 < P, < P T , ^ P, < P T ; Y, (P) = Q( y p ^ ) + ^ 

This is not separable, however, it is shown in the above 
paper that at the optimum, = P T 

Then under ^P t = P T , as long as the objective is bridgeable, 
we are done 1 

In general non-separable non-convex is extremely difficult, 
for specific problem, we can make non-separable problem 
into a separable format 




Application: Downlink power 
allocation under SINR model (Cont.) 



To be continue: how about there is linear cost term in the 
utility, then it is not true the optimum is obtained under 

Rough idea 

• Fix total power 

• Add back the term that captures the duality gap 

• Search through the total power-one dimension 



Future work: Can we do more: from 
non-separable to separable 



eneral non-separable power allocation formulation 
max £log(l + |l— ) - Cfi , s.t. [A]P < B,P lm <P t < P lM 



<T, + 



where [A], B, C 9 [H],ct are all nonnegative 
To be continued... 



Q&A 



